首页> 外文OA文献 >Deep Speaker Feature Learning for Text-independent Speaker Verification

【2h】

Deep Speaker Feature Learning for Text-independent Speaker Verification

机译：深度扬声器功能学习，用于独立于文本的扬声器验证

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recently deep neural networks (DNNs) have been used to learn speakerfeatures. However, the quality of the learned features is not sufficientlygood, so a complex back-end model, either neural or probabilistic, has to beused to address the residual uncertainty when applied to speaker verification,just as with raw features. This paper presents a convolutional time-delay deepneural network structure (CT-DNN) for speaker feature learning. Ourexperimental results on the Fisher database demonstrated that this CT-DNN canproduce high-quality speaker features: even with a single feature (0.3 secondsincluding the context), the EER can be as low as 7.68%. This effectivelyconfirmed that the speaker trait is largely a deterministic short-time propertyrather than a long-time distributional pattern, and therefore can be extractedfrom just dozens of frames.

机译：最近，深度神经网络（DNN）已用于学习说话者特征。但是，学习到的特征的质量不够好，因此，像原始特征一样，当应用于说话人验证时，必须使用复杂的后端模型（神经或概率模型）来解决剩余的不确定性。本文提出了一种用于说话人特征学习的卷积时延深度神经网络结构（CT-DNN）。我们在Fisher数据库上的实验结果表明，这种CT-DNN可以产生高质量的说话者特征：即使具有单个特征（包括上下文在内，为0.3秒），EER仍可低至7.68％。这有效地证实了说话人特征在很大程度上是确定性的短时属性，而不是长时间的分配模式，因此可以从几十个帧中提取出来。

著录项

作者
Li, Lantian; Chen, Yixiang; Shi, Ying; Tang, Zhiyuan; Wang, Dong;
展开▼
作者单位

展开▼
年度 2017
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Deep multi-metric learning for text-independent speaker verification [J] . Xu Jiwei, Wang Xinggang, Feng Bin, Neurocomputing . 2020,第Octa14期

机译：无关扬声器验证的深度多度量学习
2. Enhancement of a text-independent speaker verification system by using feature combination and parallel structure classifiers [J] . Abdalmalak Kerlos Atia, Gallardo-Antolin Ascension Neural computing & applications . 2018,第3期

机译：使用特征组合和并行结构分类器来增强文本独立的扬声器验证系统
3. Delta-MFCC Features and Information Theoretic Expectation Maximization based Text-independent Speaker Verification System [J] . Sheeraz Memon, Imran Ali Jokhio, Sana Hoor Arisar, IETE Journal of Research . 2012,第1期

机译：基于Delta-MFCC特征和信息理论期望最大化的基于文本的说话人验证系统
4. Partial AUC Optimization Based Deep Speaker Embeddings with Class-Center Learning for Text-Independent Speaker Verification [C] . Zhongxin Bai, Xiao-Lei Zhang, Jingdong Chen IEEE International Conference on Acoustics, Speech and Signal Processing . 2020

机译：基于部分AUC优化的深度演讲者嵌入和类中心学习，用于独立于文本的演讲者验证
5. Text-Independent Speaker Identification using Statistical Learning [D] . Ojutiku, Alli Ayoola. 2015

机译：使用统计学习的与文本无关的说话人识别
6. Towards understanding speaker discrimination abilities in humans and machines for text-independent short utterances of different speech styles [O] . Soo Jin Park, Gary Yeung, Neda Vesselinova, -1

机译：旨在理解人和机器中说话者的辨别能力以实现不同语音风格的与文本无关的简短发声
7. Deep multi-metric learning for text-independent speaker verification [O] . Jiwei Xu, Xinggang Wang, Bin Feng, 2020

机译：无关扬声器验证的深度多度量学习

Deep Speaker Feature Learning for Text-independent Speaker Verification

摘要

著录项

相似文献

相关主题

期刊订阅